WORLD 



IxicTUAL PROPERTY ORGANEATKW 
Internationa] Bureau 




PCX 

INTERNATIONAL APPUCATION PUBUSHED UNDER THE PATENT COOPHlATiON TItEATY (PCT) 



(51) International Patent CSassificatian ^ : 
C07K 14/38, C12N 15/80, 1/15 



Al 



(11) InteraaHonal Publkation Numba-: WO 98A)1470 

(43) Intematioiial Publkatfon Date: 15 Januaiy 1998 (15.01.98) 



(21) International AppHcatton Number: PCr/DK97/(X)305 

(22) International Filing Date: 7 July 1997 (07/)7.97) 



(30) Priority Data: 

074W96 



5 July 1996 ((»X)7.96) 



DK 



(71) Applicant {for all designated States except US}: NOVO 

NORDISK A/S [DK/DK]; Novo A116. DK-2880 B^gsvsrd 
(DK). 

(72) Inventor; and 

(75) Invmtor/Applicant (for US only}: CHRISTENSEN, Tovc 
[DKA)K]: Novo Nordisk a/s. Novo A116, DK-2880 
Bagsvzrd (DK). 

(74) Comnum Representative: NOVO NORDISK A/S; Novo Alld, 
DK-2880 Bagsvsrd (DK). 



(81) Designated States: AL. AM, AT, AU. AZ, BA, BB, BO, BR. 
BY, CA. CH. CN. CU, CZ, DE, DK, EE, ES, Fl GB, GE, 
OH, HU, IL, IS, JP, KE, KG. KP, KR, KZ, LC, LK. LR, 
LS, LT, LU, LV, MD, MG, MK, MN, MW. MX, NO, NZ, 
PL, PT, RO, RU, SD. SE. SG, SI, SK, SL, Tl, TM, TR. 
TT, UA, UG. US, UZ. VN, YU, ZW. ARIPO patent (OH, 
KE. LS. MW, SD, SZ, UG, ZW), Eurasian patent (AM, AZ, 
BY. KG. KZ. MD, RU. TI, TM). ^nopean patent (AT, BE, 
CH. DE, DK, ES, Fl. FR, GB, GR, IE. IT, LU, MC, NL, 
PT, SE). OAPI patent (BF. BJ, CF, OG. Q, CM, GA, ON, 
ML, MR. NE, SN. TO. TG). 



Pitblisbed 

WUlt mtematkmal search report. 

Before the expiration of the time limit for amending the 
claims and to be republished in the event of the receipt of 
amendments. 



(54) Title: A TRANSCRIPTION FACTOR 
(57) Abstract 



A transcription factor regulating a-amylase promoter initiated expression in filamentous fimgi, especially in AspergillU, DSA sequences 
coding for said factor, its transformation into and expression in fungal host OTganisms, and the use of said factocin such hosts for mcreasmg 
die expression of a polypeptide of interest behig produced by said host. 



FOR THE PURPOSE OF INFORMATION ONLY 



Codes used to identify States i»aity to the PCTon the front |»ges of pami^ilets publishing tntematiooal api^icattocis under Ae PCT. 



AL 


AJbna 


ES 


Spun 


LS 


Lesotho 


St 


Slovcsia 


AM 


AfSMoia 


PI 


FoilMd 


LT 


UdmaDta 


SK 


SbvAia 


AT 


Atntzia 


PR 




LU 


Luxcntboofg 


SN 


Scocxal 


AU 


AustnlU 


GA 




LV 


Utvia 


sz 


SwazOmd 


AZ 


AaertMijn 


GB 


United Kingdon 


MC 


Monaco 


TD 


Chad 


BA 
BB 


Biitiidot 


GB 
GH 


OeocgU 
Ghana 


MD 

MG 


Republic of Moldova 
Madifitatr 


TG 
TJ 


TOfD 


BE 


Belgiuni 


GN 


GviDca 


MK 


The fonncf Yu^o^lsv 


TM 




BP 




GR 


Greece 




Repnblic of Maocdonta 


TR 




BG 


Bilgirift 


m 




ML 


Mali 


TT 


lYInidad and Tobtjo 


iki 


BcaiD 


IB 


Irehnd 


MN 


MoiVDiift 


UA 


Ukraine 


BR 


Bnza 


IL 


bnel 


MR 


M«riuaU 


UG 


Uganda 


BY 


Bdwm 


IS 


IcelMid 


MW 


Malawi 


US 


United Sum of Amcfka 


CA 




IT 




MX 


Modoo 


uz 


UxbeUnan 


CP 


Cmbal Africn Rqxiblic 


JP 




NB 


N«Dr 


VN 


ViccNim 


CG 




KB 


Kenys 


NL 


NcdteriMMb 


YU 


Yiioalana 


CH 


SwkxcrluKl 


KG 




NO 


Nonray 


ZW 


Zinbnbwc 


a 


ated*Ivom 


KP 


Deaocntic Ptfiple** 


NZ 


NewZeahad 






CM 


CnnciooD 




RqmbUc (tf Korea 


PL 


Bobnd 






CN 


CMm 


KR 


R^obUc of Korea 


PT 


Puii^gal 






CU 


Otba 


KZ 


KaxikMiB 


RO 


Romania 






CZ 


C^ccti RcpttUic 


LC 


S^Loda 


RU 


Raisian Pedenlioo 






DE 


Gcmny 


U 




SD 


Sudan 






DX 
EE 


Deamnk 


LK 
LR 


SriUrita 
Liberia 


SE 
SG 


Sweden 
Snj^pore 







wo 98/01470 



PCT/DK97/0a305 



1 

Title: A Transcription Factor 

FTRT.n OF THE INVENTION 

5 

The present invention relates to a transcription factor found in 
filamentous fungi, especially in Aspergillii, DNA sequences 
coding for said factor, its transformation into and expression 
in fungal host organisms, and the use of said factor in such 
10 hosts for increasing the expression of a polypeptide of interest 
being produced by said host . 

SArKHROIIND OF THE INVENTION 

15 Transcription factors are well known proteins involved in the 
initiation of transcription. They have been studied intensively 
in many different organisms and have also been described in 
fungi. Dhawale and Lane (NAR (1993) 21 5537-5546) have recently 
compiled the transcription factors from fungi, including the 

20 filamentous fungi. 

Many of the transcription factors are regulatory proteins; they 
bind to the promoter DNA and either activate or repress 
transcription as a response to stimuli to the cell . 

25 

The expression of the a-amylase gene in A. oryzae is regulated 
in response to the available carbon source. The gene is 
expressed at its maximum when the organism is grown on starch or 
maltose (Lachraund et al, (1993) Current Microbiology 26 47-51; 
30 Tada et al. (1991) Mol. Gen. Genet. 229 301-306). The expression 
of a-amylase is regulated at the transcriptional level as shown 
by Lachmund et al. {supra), which strongly suggests that 
transcription factors are involved in the regulation, but so far 
no gene for such a factor has been identified. 

35 

The promoter of the a-amylase gene has been studied by deletion 
analysis (Tada et al. (1991) Agric. Biol. Chew. 55 1939-1941; 



wo 98/01470 



PCT/DK97/00305 



2 

Tsuchiya et al. (1992) Biosci. Biotech. Biochem, 56 1849-1853; 
Nagata et ai . (1993) Mol. Gen. Genet, 237 251-260) . The authors 
of these papers propose that a specific sequence of the promoter 
is responsible for the maltose induction. Nagata et aJ. (supra) 
5 used this sequence as a probe in a gel shift experiment to see 
whether any proteins from A. nidulans nuclear extracts were able 
to bind to the promoter sequence. Three such proteins were 
found, but no involvement of these proteins in expression was 
shown. None of the proteins have been purified or identified by 
10 other means. Their genes likewise remain unknown. 

SVMMARY OF THE INVENTION 

The present invention relates to a transcription factor regulat- 
15 ing the expression of the a-amylase promoter in filamentous 
fungi . 

Accordingly, in a first aspect the invention relates to a DNA 
construct comprising a DNA sequence encoding a transcription 
20 factor of the invention, which DNA sequence comprises 

a) the transcription factor encoding part of the DNA sequence 
cloned into plasmid pToC320 present in E. coli ToC1058, DSM 
10666, or 

b) an analogue of the DNA sequence defined in a) , which 

25 i) is at least 60% homologous with the DNA sequence defined 
in a) , or 

ii) hybridizes with the same nucleotide probe as the DNA 
sequence defined in a) , or 

iii) encodes a transcription factor which is at least 50% 
30 homologous with the transcription factor encoded by a DNA 

sequence comprising the DNA sequence defined in a) , or 

iv) encodes a transcription factor which is immunologically 
reactive with an antibody raised against the purified 
transcription factor encoded by the DNA secjuence defined 

35 in a) , or 

v) conrplements the mutation in ToC879, i,e. enables ToC879 to 
grow on cyclodextrin and produce lipase when transformed 
with said DNA sequence. 
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The full length genomic DNA sequence encoding a transcription 
factor has been derived from a strain of the filamentous fungus 
Aspergillus oryzae and has been cloned into plasmid pToC320 
5 present in E, coli ToC1058, DSM 10666. 

Said transcription factor encoding DNA sequence harboured in 
pToC320, DSM 10666, is believed to have the same sequence as 
that presented in SEQ ID NO: 1 and SEQ ID NO: 2. Accordingly, 
10 whenever reference is made to the transcription factor encoding 
part of the DNA sequence cloned into plasmid pToC320 present in 
DSM 10666 such reference is also intended to include the 
transcription factor encoding part of the DNA sequence presented 
in SEQ ID NO: 1 and SEQ ID NO: 2. 

15 

Accordingly, the terms "the transcription factor encoding part 
of the DNA sequence cloned into plasmid pToC320 present in DSM 
10666" and "the transcription factor encoding part of the DNA 
sequence presented in SEQ ID NO: 1 and SEQ ID NO: 2" may be used 
20 interchangeably. 

In further aspects the invention provides an expression vector 
harbouring the DNA construct of the invention, a cell comprising 
said DNA construct or said expression vector and a method of 
25 producing a peptide exhibiting transcription factor activity, 
which method comprises culturing said cell under conditions 
permitting the production of the transcription factor. 

Such a transcription factor of the invention will typically 
30 originate from a filamentous fungus. 

The term "filamentous fungus" is intended to include the groups 
Phycomycetes, Zygomycetes, Ascomycetes, Basidiomycetes and 
fungi imperfect i, including Hyphomycetes such as the genera 
35 Aspergillus, Penicillium, Trichoderwa, Fusarium and Humicola. 

The invention also relates to a method of producing a filamen- 
tous fungal host cell comprising the introduction of a DNA 
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4 

fragment coding for any such factor into a filamentous fungus 
wherein an a-amylase promoter or a co-regulated promoter 
regulates the expression of a polypeptide of interest in a 
manner whereby said factor will be expressed in said fungus. 

In a further aspect the invention relates to a method of produ- 
cing a polypeptide of interest, the expression of which is 
regulated by an a-amylase promoter or a co-regulated promoter, 
comprising growing a filamentous fungal host cell as described 
above under conditions conducive to the production of said 
factor and said polypeptide of interest, and recovering said 
polypeptide of interest. 

Finally the invention relates to the use of said factor for 
15 regulating the expression of a polypeptide of interest in a 
filamentous fungus. 

In this context, regulation means to change the conditions under 
which the factor of the invention is active. This could mean 
20 different pH, substrate, etc. regimes, whereby the resulting 
effect is an improved regulation of the expression of the 
protein of interest . 

Furthermore, regulation also comprises events occurring in the 
25 growth phase of the fungus during which the transcription factor 
is active. Depending on the circumstances, both advancing and/or 
postponing the phase wherein the factor is active may enhance 
the expression and thus the yield. 

30 In addition, using standard procedures known in the art, the 
specific DNA sequences involved in the binding of a 
transcription factor may be identified, thereby making it 
possible to insert such sequences into other promoters not 
normally regulated by the factor and enabling those promoters to 

35 be under the regulation of said factor. 



BRIEF DESCRIPTION OF THE TABLEfi AND DRAWTKrn 
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In the figures 

Fig. 1 shows the structure of the plasmid pMT1657, the 
5 construction of which is described in Example 1; 

Fig. 2 shows the structure of the plasmid pToC316, the 
construction of which is described in Example 1; 

10 Fig. 3 shows the structure of the plasmid pToC320, the 
construction of which is described in Example 1; 

Fig. 4 shows the structure of the plasmids pToC342 and pToC359, 
the construction of which are described in Example 3; 

15 

Fig. 5 shows the structure of the plasmid pToC298, the 
construction of which is described in Example 4; 

Fig. 6 shows the results of lipase production by a p960 
20 transformant of A. oryzae IF04177 cultured in YP media 

containing 2% glucose ( — ■ — ) or 10% glucose ( — ♦ — ) , in 
comparison to ToC1075 cultured in YP media containing 2% glucose 

( — □ — ) or 10% glucose ( — 0 — ) and described in Example 4; 

25 Fig. 7 shows the results of lipase production by ToC1139 
cultured in YP media containing 2% glucose ( — ■ — ) or 10% 
glucose ( — ♦ — ) , in comparison to ToC1075 cultured in YP media 
containing 2% glucose ( — □ — ) or 10% glucose ( — 0 — ) and 
described in Example 4; and 

30 

Fig. 8 shows the autoradiograph results of A. niger DNA digested 
with the following restriction enzymes: lane 2, Xbal; lane 3, 
JQnaJ; lane 4, Sal J; lane 5, Hindlll; lane 6, EcoRI; lane 7, 
Bglll; lane 8, BamHI; lanes 1 and 9 contain ^^P-labelled 1 DNA 
35 digested with BetEII. The experiment is described in Example 5. 
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DETAILED DESCRIPTION OF THE INVENTION 

In a first aspect the invention relates to a DNA construct 
conprising a DNA sequence encoding a transcription factor 
regulating an a-amylase promoter, which DNA sequence comprises 

a) the transcript ion factor encoding part of the DNA sequence 
cloned into plasmid pToC320 present in E. coli ToC1058, DSM 
10666, or 

b) an analogue of the DNA sequence defined in a) , which 

i) is at least 60% homologous with the DNA sequence 
defined in a) , or 

ii) hybridizes with the same nucleotide probe as the 
DNA sequence defined in a) , or 

iii) encodes a transcription factor which is at least 
50% homologous with the transcription factor 
encoded by a DNA sequence comprising the DNA 
sequence defined in a) , or 

iv) encodes a transcription factor which is 
immunologically reactive with an antibody raised 
against the purified transcription factor encoded 
by the DNA sequence defined in a) , or 

v) complements the mutation in ToC879, i.e. enables 
ToC879 to grow on cyclodextrin and produce lipase 
when transformed with said DNA sequence. 

As defined herein, a DNA sequence analogous to the transcription 
factor encoding part of the DNA sequence cloned into plasmid 
pToC320 present in E. coli ToC1058, DSM 10666, is intended to 
indicate any DNA sequence encoding a transcription factor 
regulating an a-amylase promoter, which transcription factor has 
one or more of the properties cited under (i) - (v) above. 

The analogous DNA sequence may be isolated from a strain of the 
filamentous fungus A, oryzae producing the transcription factor, 
or another or related organism and thus, e.g. be an allelic or 
species variant of the transcription factor encoding part of the 
DNA sequence cloned into plasmid pToC320 present in DSM 10666. 
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Alternatively, the analogous sequence may be constiructed on the 
basis of the DNA sequence presented as the transcription factor 
encoding part of SEQ ID NO: 1 and SEQ ID NO: 2, e.g. be a sub- 
5 sequence thereof, and/ or by introduction of nucleotide sub- 
stitutions which do not give rise to another amino acid sequence 
of the transcription factor encoded by the DNA sequence, but 
which corresponds to the codon usage of the host organism 
intended for production of the transcription factor, or by 
10 introduction of nucleotide substitutions which may give rise to 
a different amino acid sequence. 

When carrying out nucleotide substitutions, amino acid residue 
changes are prefercibly of a minor nature, that is conservative 
15 amino acid residue substitutions that do not significantly 
affect the folding or activity of the protein, small deletions, 
typically of one to about 30 amino acid residues; small amino- 
or carboxyl- terminal extensions. 

20 Examples of conservative substitutions are within the group of 
basic amino acids (such as arginine, lysine, histidine) , acidic 
amino acids (such as glutamic acid and aspartic acid) , polar 
amino acids (such as glutamine and asparagine) , hydrophobic 
amino acids (such as leucine, isoleucine, valine), aromatic 

25 amino acids (such as phenylalanine, tryptophan, tyrosine) and 
small amino acids (such as glycine, alanine, serine, threonine, 
methionine) . For a general description of nucleotide 
substitution, see e.g. Ford, et al., (1991), Protein Expression 
and Purification 2, 95-107. 

30 

It will be apparent to persons skilled in the art that such 
substitutions can be made outside the regions critical to the 
function of the molecule and still result in an active 
transcription factor. Amino acid residues essential to the 
35 activity of the transcription factor encoded by a DNA construct 
of the invention, and therefore preferably not subject to 
substitution, may be identified according to procedures known in 
the art, such as site-directed mutagenesis or alanine -scanning 
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mutagenesis (cf. e.g. Cunningham and Wells, (1989), Science 244 
1081-1085) . In the latter technique mutations are introduced at 
every residue in the molecule, and the resultant mutant 
molecules are tested for biological activity (i.e. transcription 
5 factor regulating an a-amylase promoter) to identify amino acid 
residues that are critical to the activity of the molecule. 

The homology referred to in (i) above is determined as the 
degree of identity between the two sequences indicating a 

10 derivation of the one sequence from the other. The homology may 
suitably be determined by means of computer programs known in 
the art such as GAP provided in the GCG program package 
{Needleman, S.B. and Wunsch, CD., (1970), Journal of Molecular 
Biology 48 443-453). Using GAP with the following settings for 

15 DNA sequence comparison: GAP creation penalty of 5.0 and GAP 
extension penalty of 0.3, the coding region of the DNA sequence 
exhibits a degree of identity preferably of at least 60%, more 
preferably at least 70%, more preferably at least 80%, more 
preferably at least 90%, more preferably at least 95% with the 

20 transcription factor encoding part of the DNA sequence shown in 
SEQ ID NO: 1 and SEQ ID NO: 2. 

The hybridization referred to in (ii) above is intended to indi- 
cate that the analogous DNA sequence hybridizes to the same 

25 probe as the DNA sequence encoding the transcription factor 
under certain specified conditions, which are described in 
detail in the Materials and Methods section hereinafter. The 
oligonucleotide probe to be used is the DNA sequence cor- 
responding to the transcription factor encoding part of the DNA 

30 sequence shown in SEQ ID NO: 1 or SEQ ID NO: 2 or a fragment 
thereof . 

The homology referred to in (iii) above is determined as the 
degree of identity between the two sequences indicating a 
35 derivation of the first sequence from the second. The homology 
may suitably be determined by means of computer programs known 
in the art such as GAP provided in the GCG program package 
(Needleman, S.B. and Wunsch, CD., supra). Using GAP with the 
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following settings for transcription factor sequence comparison: 
GAP creation penalty of 3.0 and GAP extension penalty of 0.1, 
the transcription factor encoded by an analogous DNA sequence 
exhibits a degree of identity preferably of at least 50%, more 
5 preferably at least 60%, more preferably at least 70%, even more 
preferably at least 80%, especially at least 90% with the 
transcription factor encoded by a DNA construct comprising the 
transcription factor encoding part of the DNA sequence shown in 
SEQ ID NO: 2, e.g. with the amino acid sequence SEQ ID NO: 3. 

10 

In connection with property (iv) the immunological reactivity 
may be determined by the method described in the Materials and 
Methods section hereinafter. 

15 In relation to the property (v) the complementation method is 
described in Example 1 herein. 

The DNA sequence encoding a transcription factor of the 
invention can be isolated from the strain Aspergillus oryzae IFO 
20 4177 using standard methods e.g. as described by Sambrook, et 
al., (1989) Molecular Cloning: A Laboratory Manual. Cold Spring 
Harbor Lab.; Cold Spring Harbor, NY. 

General RNA and DNA isolation methods are also disclosed in WO 
25 93/11249 and WO 94/14953, the contents of which are hereby 
incorporated by reference. A more detailed description of the 
complementation method is given in Example 1 herein. 

Alternatively, the DNA encoding a transcription factor of the 
30 invention may, in accordance with well-known procedures, be 
conveniently isolated from a suitable source, such as any of the 
below mentioned organisnts, by use of synthetic oligonucleotide 
probes prepared on the basis of a DNA sequence disclosed herein. 
For instance, a suitable oligonucleotide probe may be prepared 
35 on the basis of the transcription factor encoding part of the 
nucleotide sequences presented as SEQ ID NO: 1 or any suitable 
subsequence thereof , or on the basis of the amino acid sequence 
SEQ ID NO: 3. 
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The invention relates specifically to a transcription factor 
regulating the expression of the a-amylase promoter in a 
filamentous fungus, which factor as indicated in Example 2 may 
5 even regulate the expression of other genes . 

In this context the expression "filamentous fungus" is intended 
to include the groups Phycomycetes, Zygomycetes, Ascomycetes, 
Basidiomycetes and fungi inperfecti, including Hyphomycetes such 
10 as the genera Aspergillus, Penicilliwn, Trichoderma, Fusarium 
and HuiTjicoIa. 

In this context the expression "a-amylase promoter" means a 
sequence of bases immediately upstream from an a-amylase gene 
15 which RNA polymerase recognises and binds to promoting 
transcription of the gene coding for the a-amylase. 

As indicated, transcription factors are known from many or- 
ganisms and it is therefore expected that similar or cor- 
20 responding factors may be found originating from other fungi of 
the genera Aspergillus, Trichoderma, Penicillium, Fusarium, 
Humicola, etc., having an enhancing effect on the expression of 
a polypeptide being under the regulation of amylase promoters in 
any fungus belonging to any of these genera. 

25 

A comparison of the DNA sequence coding for the transcription 
factor regulating the a-amylase promoter has shown some degree 
of homology to a transcription factor (CASUCI) regulating the 
expression of glucosidase in Candida and to MAL63 of 
30 Saccharomyces cerevisiae as disclosed in Kelly and Kwon-Chung, 
(1992) J*. Bacterial, 174 222-232. 

It is at present contemplated that a DNA sequence encoding a 
transcription factor homologous to the transcription factor of 
35 the invention, i.e.' an analogous DMA sequence, may be obtained 
from other microorganisms. For instance, the DNA sequence may be 
derived by a similar screening of a cDNA library of another 
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microorganism, such as a strain of Aspergillus, Saccharomyces , 
Erwinia, Fusarium or Trichoderma. 

An isolate of a strain of A. oryzae from which the gene coding 
5 for a transcription factor of the invention has been inactivated 
has been deposited by the inventors according to the Budapest 
Treaty on the International Recognition of the Deposit of 
Microorganisms for the Purposes of Patent Procedure at the DSM, 
Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH, 
10 Mascheroder Weg lb, D-38124 Braunschweig, DEUTSCHLAND. 

Deposit date : 6 MAY 1996 (06.05.96) 

Depositor's ref . : ToC879 = NN049238 

DSM designation: Aspergillus oryzae DSM No. 10671 

15 

The deposited strain Aspergillus oryzae DSM No, 10671 can be used 
to isolate a transcription factor according to the invention 
from any strain of Aspergillus oryzae and any other fungal 
strain having such a gene by conplementation as described 
20 hereinafter. 

The expression plasmid pToC320 conprising the full length 
genomic DNA sequence encoding the transcription factor of the 
invention has been transformed into a strain of E. coli 

25 resulting in the strain ToC1058, which has been deposited by the 
inventors according to the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of 
Patent Procedure at the DSMZ, Deutsche Sammlung von 
Mikroorganismen und Zellkulturen GmbH., Mascheroder Weg lb, D- 

30 38124 Braunschweig, DEUTSCHLAND. 

Deposit date : 6 MAY 1996 (06.05.96) 

Depositor's ref. : ToC1058 = NN049237 
DSM designation: E. coli DSM No. 10666 

35 

According to the invention, factors of this type originating 
from the species A. oryzae, A. niger, A. awamori, etc., 
especially A. oryzae IF04177 are preferred. 
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The transcription factor of the invention has been found not 
only to be involved in the regulation of the a-amylase promoter, 
but also in the regulation of the glucoamylase promoter from A. 
5 oxyzste. 

Especially, the invention comprises any factor having an amino 
acid sequence conprising one or more fragments or combinations 
of fragments of the amino acid sequence depicted as SEQ ID NO: 
10 3. 

Truncated forms of the transcription factor may also be active. 
By truncated forms are meant modifications of the transcription 
factor wherein N- terminal, C-terminal or one or more internal 
15 fragments have been deleted. 

A further aspect of the invention relates to a DNA sequence 
coding for any of these factors. 

20 In this aspect the invention especially comprises any DNA 
sequence coding for one or more fragments of the amino acid 
sequence depicted as SEQ ID NO: 3. 

More specifically the invention relates to a DNA sequence 
25 comprising one or more fragments or a combination of fragments 
of the DNA sequence depicted as SEQ ID NO: 1 and SEQ ID NO: 2. 

According to a further aspect the invention relates to a method 
of producing a filamentous fungal host cell conprising the 
30 introduction of any of the above mentioned DNA fragments into a 
filamentous fungus wherein the a-amylase promoter or another co- 
regulated promoter regulates the expression of a polypeptide of 
interest in a manner whereby said factor will be expressed in 
said fungus. 

35 

The introduction of said DNA fragment may be performed by means 
of any well known standard method for the introduction of DNA 
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into a filamentous fungus, such as by use of an expression 
vector and host cells as described below. 

Therefore, the invention also provides a recombinant expression 
5 vector comprising the DNA construct of the invention. 

The expression vector of the invention may be any expression 
vector that is conveniently subjected to recombinant DNA 
procedures, and the choice of vector will often depend on the 
10 host cell into which it is to be introduced. 

Thus, the vector may be an autonomously replicating vector, i.e. 
a vector which exists as an extrachromosomal entity, the 
replication of which is independent of chromosomal replication, 
15 e.g. a plasmid. Alternatively, the vector may be one which, when 
introduced into a host cell, is integrated into the host cell 
genome and replicated together with the chromosome ( s ) into which 
it has been integrated. 

20 In the expression vector, the DNA sequence encoding the 
transcription factor should either also contain the expression 
signal normally associated with the transcription factor or 
should be operably connected to a suitable promoter and 
terminator sequence. The promoter may be any DNA sequence which 

25 shows transcriptional activity in the host cell of choice and 
may be derived from genes that are either homologous or 
heterologous to the host cell. The procedures used to ligate the 
DNA sequences coding for the transcription factor, the promoter 
and the terminator, respectively, and to insert them into 

30 suitable vectors are well known to persons skilled in the art 
(cf., Sambrook, etal., supra). 

Exarrples of suitable promoters for use in filamentous fungal 
host cells are, for instance, the A. nidulans ADH3 promoter 
35 (McKnight, at al. (1985) The EMBO J. 4 2093-2099) or the tpiA 
promoter. Examples of other useful promoters are those derived 
from the gene encoding Aspergillus oryzae a-amylase, Aspergillus 
niger neutral a-amylase, Aspergillus niger acid stable a- 
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amylase, Aspergillus niger, Aspergillus awamori, or Aspergillus, 
oryzae glucoamylase (gluA) , A. oryzae alkaline protease (alp), 
A. oryzae nitrate reductase (niaD) , Aspergillus oryzae triose 
phosphate isomerase {tpi) , Aspergillus nidulans acetamidase, or 
5 an Aspergillus pronxDter coding for an amino acid biosynthetic 
gene such as argB, 

In yet another aspect the invention provides a host cell 
comprising the DNA construct of the invention and/or the 
10 recombinant expression vector of the invention. 

Preferably, the host cell of the invention is a eukaryotic cell, 
in particular a fungal cell such as a yeast or filamentous 
fungal cell. In particular, the cell may belong to a species of 

15 Trichoderma, preferably Trichoderma harzianxim or Trichoderma 
reesei, or a species of Aspergillus, most preferably Aspergillus 
oryzae or Aspergillus niger. Fungal cells may be transformed by 
a process involving protoplast formation and transformation of 
the protoplasts followed by regeneration of the cell wall in a 

20 manner known per se. The use of Aspergillus as a host microorga- 
nism is described in EP 238 023 (Novo Nordisk A/S) , the contents 
of which are hereby incorporated by reference. The host cell may 
also be a yeast cell, e.g. a strain of Saccharomyces , in 
particular Saccharomyces cerevisiae, Saccharomyces kluyveri or 

25 Saccharomyces uvarum, a strain of Schizosaccharomyces sp., such 
as Schizosaccharomyces pombe, a strain of Hansenula sp., Pichia 
sp. , Yarrowia sp., such as Yarrowia lipolytica, or Kluyveromyces 
sp., such as Kluyveromyces lactis, 

30 The endogenous amyR gene of the host cell may be deleted or 
inactivated by other means. The introduction of amyR control by 
a heterologous promoter will then lead to a completely new 
scheme of regulation of the a-amylase promoter. If, for 
example, amyR is fused to the A. oryzae niaD promoter, the a- 

35 amylase promoter will become inducible by nitrate. If, instead 
of the niaD promoter, a pal C- regulated promoter is used, the 
activity of the a-amylase promoter will be regulated by pH. 
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The invention also comprises a method of producing a polypeptide 
of interest, whereby a host cell as described above is grown 
under conditions conducive to the production of said factor and 
5 said polypeptide of interest, and said polypeptide of interest 
is recovered. 

The medium used to culture the transformed host cells may be any 
conventional medium suitable for growing the host cells in 

10 question. The expressed polypeptide of interest may conveniently 
be secreted into the culture medium and may be recovered there- 
from by well-known procedures including separating the cells 
from the medium by centrifugation or filtration, precipitating 
proteinaceous components of the medium by means of a salt such 

15 as ammonium sulphate, followed by chromatographic procedures 
such as ion exchange chromatography, affinity chromatography, or 
the like. 

According to the invention the method may be used to produce a 
20 polypeptide of interest that is a medicinal polypeptide, espe- 
cially such medicinal polypeptides as growth hormone, insulin, 
blood clotting factor, and the like. 

The method of the invention may also be used for the production 
25 of industrial enzymes, such as proteases, lipases, amylases, 
glucoamylases, oxido reductases, carbohydrases, carbonyl 
hydrolases, cellulases, esterases, etc. 

According to a further aspect of the invention said transcrip- 
30 tion factor may be used for enhancing the expression of a 
polypeptide of interest in a filamentous fungus, such as a 
fungus of the genus Aspergillus, Trichoderma, Penicillimn, 
Fusarium, Humicola, etc., especially of the species A. oryzae, 
A. niger, A. awamori, etc., and specifically A. oryzae. 

35 

The transcription factor of the invention may thus be used to 
enhance the expression of a medicinal polypeptide, such as 
growth hormone, insulin, blood clotting factor, etc. 
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Also, the expression of industrial enzymes, such as proteases, 
lipases, amylases, glucoamylases, oxido reductases, 
carbohydrases, carbonyl hydrolases, cellulases, esterases, etc., 
5 may be enhanced by the use of the transcription factor of the 
invention. 

The transcription factor may also be used to identify the 
sequences in the a-amylase promoter to which it binds. For 

10 example, this could be done by making a GST-fusion protein with 
the DNA binding domain of AmyR, such as the zinc finger, for 
production in £. coli. Such fusion proteins may be conveniently 
made using commercially available kits, for example, ''The GST 
Gene Fusion Kit" from Pharmacia. The purified GST-fusion 

15 protein can then be used in conventional in vitro techniques 
such as gel shift assays or DNA footprint analyses (Kulmburg, 
P., et al. (1992) Molecular and Cellular Biology 12 1932-1939; 
Lutfiyya, L.L., and Johnston, M. (1996) Molecular and Cellular 
Biology 16 4790-4797) . The identification of the AmyR binding 

20 site will make it possible to insert these sequences in other 
promoters not normally regulated by AmyR. 

MATERTAT,fi AND MKTffOn.q 

25 

Hybridization : 

Suitable hybridization conditions for determining hybridization 
between a nucleotide probe and an "analogous" DNA sequence of 
the invention may be defined as described below. The 
30 oligonucleotide probe to be used is the DNA sequence cor- 
responding to the transcription factor encoding part of the DNA 
sequence shown in SEQ ID NO: 1, i.e. nucleotides 1691.. 2676 + 
2743.. 3193 + 3278.. 3653 in SEQ ID NO: 1, or a fragment thereof, 
e.g. nucleotides 1770-1800 in SEQ ID NO: 1. 

35 

Hybridization conditions 

Suitable conditions for determining hybridization between a 
nucleotide probe and a homologous DNA or RNA sequence involves 
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pre- soaking of the filter containing the DNA fragments or RNA to 
hybridize in 5x SSC (standard saline citrate buffer) for 10 min, 
and prehybridization of the filter in a solution of 5x SSC 
(Sambrook, et al., supra), 5x Denhardt's solution (Sambrook, et 
s al., supra), 0.5 % SDS and 100 ixg/ml of denatured sonicated 
salmon sperm DNA (Sambrook, et al., supra), followed by hybridi- 
zation in the same solution containing a random-primed (Fein- 
berg, A. P. and Vogelstein, B. (1983) Anal. Biochem. 132 6-13), 
^^P-dATP-labeled (specific activity > 1 x 10^ cpm//ig) probe for 

10 12 hours at ca. 65 °C. The filter is then washed two times for 30 
minutes in 2x SSC, 0.5 % SDS at preferably not higher than 50°C, 
more preferably not higher than 55**C, more preferably not higher 
than 60**C, more preferably not higher than 65**C, even more 
preferably not higher than 70*>C, especially not higher than 

15 75<>C. 

Molecules to which the nucleotide probe hybridizes under these 
conditions are detected using a Phospho Image detector. 

20 Immunological cross-reactivity: 

Antibodies to be used in determining immunological cross - 
reactivity may be prepared by use of a purified transcription 
factor. More specifically, antiserum against the transcription 
factor of the invention may be raised by immunizing rabbits (or 

2 5 rodents) according to the procedure described by N, Axelsen et 
al. in: A Manual of Quantitative Immunoelectrophoresis, 
Blackwell Scientific Publications, 1973, Chapter 23, or A. 
Johnstone and R, Thorpe, Immunochemistry in Practice, Blackwell 
Scientific Publications, 1982 (more specifically pp. 27-31) . 

30 Purified immunoglobulins may be obtained from the antisera, for 
example by salt precipitation ((NHjj SO,), followed by dialysis 
and ion exchange chromatography, e.g. on DEAE-Sephadex. Im- 
munochemical characterization of proteins may be done either by 
Outcherlony double-diffusion analysis (O. Ouchterlony in: 

35 Handbook of Experimental Immunology (D.M. Weir, ed.), Blackwell 
Scientific Publications, 1967, pp. 655-706), by crossed Im- 
munoelectrophoresis (N. Axelsen et al., supra, Chapters 3 and 
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4), or by rocket immunoelectrophoresis (N. Axelsen et al., op 
cit,, Chapter 2). 

EXAMPLES 

5 

EXAMPLE 1 

Cloning of the amyR transcription factor from A. oryzae 

amyR was cloned by complementation of an A. oryzae mutant strain 
unable to express two different proteins both under control of 
10 the TAKA-amylase promoter. The mutant A. oryzae strain ToC879 
was made by mutagenesis of a strain, SRe440, containing a lipase 
(HLL) encoding cDNA under control of the TAKA promoter and one 
copy of the TAKA-amylase gene transcribed from its own promoter. 

15 The mutant was identified and isolated by its amylase negative 
(amylase) phenotype and subsequently shown to be lipase 
negative (lipase ) as well. 

The strain ToC879 contains intact copies of both expression 
20 cassettes. The amylase' phenotype makes ToC879 unable to grow on 
plates containing 1% cyclodextrin as the sole carbon source, 
while the parent strain SRe440 will grow on such plates. 

ToC879 has been deposited at DSM under the name DSM No. 10671. 

25 

awyR was isolated by co- transforming ToC879 with an A. oryzae 
cosmid library and an autonomously replicating pHelpl based 
plasmid (D. Gems, I. L. Johnstone, and A. J. Clutterbuck (1991) 
Gene 98 61-67) carrying the bar gene from Streptomyces 
30 bygroscopicus which confers resistance to glufosinate. The 
transformants were subjected to selection on plates containing 
cyclodextrin as the sole carbon source and screened for a 
concurrent reversion to the lipase* phenotype. 

35 The transforming DNA was rescued from colonies able to grow on 
cyclodextrin. Subcloning resulted in the isolation of a 4.3 kb 
DNA fragment able to complement both phenotypes of ToC879. The 
gene harboured on this fragment was named amyR. 



WO98A)1470 



PCT/DK97/00305 



19 

Construction of the pHelpl derivativ pMT1657 

A plastnid, pMT1612, was made by ligation (and subsequent trans- 
formation into E. coli DH5a) of the following four fragments: 

i) the E. coli vector pToC65 (described in EP 531 372) cut with 
Sphl/Xbal, 

ii) a PCR fragment (containing the A. nidulans amdS promoter) 
cut with Sphl/BamHI, 

iii) a 0.5 Icb BamHl/XhoI fragment from pBPlT (B. Staubinger et 
al., (1992) Fungal Genetics Newsletter 39 82-83) containing the 
Jbar gene, and 

iv) a 0.7 kh Xhol/Xbal fragment from pIC AMG/Term (EP 
Application No. 87103806.3) containing the A. niger glucoamylase 
transcription terminator. 

The PCR fragment containing the amdS promoter was made using the 
plasmid pMSX-6Bl (M. E. Katz et al., (1990) Afoi. Gen. Genet. 220 
373-376) as substrate DNA and the two oligonucleotides 4650 (SEQ 
ID NO: 4) and 4561 (SEQ ID NO: 5) as primers. 

4650: CTTGCATGCCGCCAGGACCGAGCAAG, SEQ ID NO: 4 

4651: CTTGGATCCTCTGTGTTAGCTTATAG . SEQ ID NO: 5 

pMSX-6Bl contains an amdS promoter up mutation called 1666. 

pMT1612 was cut with Hindlll, dephosphorylated and ligated to a 
5.5 kb Hindlll fragment from pHelpl containing the AMAl se- 
quence. The resulting plasmid, pMT1657 is self -replicating in 
Aspergilli and can be selected for by increased resistance to 
glufosinate. pMT1657 is depicted in Fig. 1, wherein PamdS 
represents the amdS promoter of fragment ii) above, Jbar 
represents fragment iii) above, and Tamg represents fragment iv) 
above. 

Construction of the cosmid library 

A cosmid library of Aspergillus oryzae was constructed essen- 
tially according to the instructions from the supplier of the 
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^^SuperCosl cosmid vector kit" {Stratagene Cloning Systems, La 
Jolla CA, USA) . 

Genomic DNA of A. oryzae IF04177 was prepared from protoplasts 
5 made by standard procedures (Christensen, T., et. al. (1988) 
Biotechnology 6 1419-1422) . 

After isolation the protoplasts were pelleted by centrifugation 
at 2500 rpm for 5 minutes in a Labofuge T (Heto) ; the pellet was 
10 then suspended in 10 mM NaCl, 20 mM Tris-HCl (pH 8.0), 1 mM 
EDTA, 100 /ig/ntl proteinase K and 0,5% SDS as stated in the 
manual from the Supercos 1 cosmid vector kit; the rest of the 
DNA preparation was done according to the instructions of the 
kit. 

15 

The size of the genomic DNA was analysed by electrophoresis 
using the CHEF-gel apparatus (Bio-Rad Laboratories, Hercules CA, 
USA) . A 1% agarose gel was run for 20 hours at 200 volts with a 
10-50 second pulse. The gel was stained with ethidium bromide 

20 and photographed. The DNA was 50->100 kb in size. The DNA was 
partially digested using Sau3A, The size of the digested DNA was 
20-50 kb determined by the same type of CHEF-gel analysis as 
above. The CsCl gradient banded SuperCosl vector was prepared 
according to the manual. Ligation and packaging were likewise 

25 performed as described in the manual. 

After titration of the library, all of the packaging mix from 
one ligation and packaging was transfected into the host cells, 
XLl-Blue MR, and plated on 50 /zg/ral ampicillin LB plates. 

30 Approximately 3800 colonies were obtained. Cosmid preparations 
from 10 colonies showed that they all had inserts of the 
expected size. The colonies were picked individually and 
inoculated in microtiter plate wells with 100 fil LB (100 fig/rtH 
ampicillin) and incubated at 37«>C overnight. 100 /il of 50% 

35 glycerol was added to each well, and the entire library was 
frozen at -80«>C. A total of 3822 colonies were stored. 



wo 98/01470 



PCT/DK97/00305 



21 

This represents the A. oryzae genome approximately 4.4 times. 
After picking the colonies the plates were scraped off, the 
scrape -off pooled and the total library was also stored in four 
pools as frozen glycerol stock. The four pools were named 
5 ToC901-ToC904. 

The individually frozen colonies in the library were inoculated 
onto LB-plates (100 /zg/ml ampicillin) by using a multipin device 
of 6 rows of 8 pins fitting into half a microtiter dish. Plates 
10 were made containing colonies from all clones in the library. 

The plates were incubated at 37**C overnight. Sterilized Whatman 
540 filters cut to the size of a petri dish were placed upon the 
colonies which were incubated for two more hours at 37^C. The 
15 filters were transferred to LB plates containing 200 /ig/ml of 
chloramphenicol and the plates were incubated overnight at 37^0. 

The next day the filters were washed twice in 0.5 M NaOH for 5 
minutes, then twice in 0.5 M Tris-HCl (pH7.4) for 5 minutes and 
20 then twice in 2x SSC for 5 minutes. The filters were wetted with 
ethanol and air dried. 

Selection of ajnyR clones 

Cosmid DNA was prepared from ToC901-904 and introduced into 
25 ToC879 by CO -trans format ion with pMT1657. The transformation 
procedure is described in EP Application No. 87103806,3. 
Approximately 8700 transf ormants were selected by resistance to 
1 mg/ml glufosinate in minimal plates (Cove D.J. (1966) BBA 113 
51-56) containing 1 M sucrose for osmotic stabilization and 10 
30 mM (NHjjSO^. 

Ten randomly chosen transf ormants were reisolated once on the 
same type of plates. Conidiospores from these 10 transf ormants 
were inoculated in minimal medium containing 1 mg/ml glufosinate 
35 and grown at 30**C until enough mycelium for DNA preparation 
could be harvested. DNA was prepared as described in T. Chris- 
tensen et al. {supra) . 
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The uncut DNA was applied to a 0.7% agarose gel, and electropho- 
resis was performed, followed by Southern blotting. The blot was 
hybridized with a "P-labelled SuperCosl specific DNA fragment. 
Each of the ten transformants showed a band with a higher 
5 mobility than the linear chromosomal DNA. Each of the bands 
also hybridized to a pHelpl specific probe, indicating that the 
co-transformation frequency of the cosmid library was close to 
100% and that the cosmids had integrated into the autonomously 
replication vector pHelpl. 

10 

The transformants were unstable as expected for pHelpl transfor- 
mants. Less than 10% of the conidiospores from a glufosinate 
resistant colony gave rise to glufosinate-resistant progeny. 

15 Conidiospores from all the transformants were collected in 8 
pools and plated on minimal plates (Cove D.J., supra) containing 
1 mg/ml glufosinate, 10 mM (NHJ jSO^ and 1% b-cyclodextrin 
(Kleptose from Roquette Freres', 62136 Lestem, France) 

20 Four colonies were obtained from one of the pools and one from 
one of the other pools. Two of the colonies from the first pool 
were reisolated once on the same kind of plates. 

Conidiospores from the reisolated colonies were plated on 
25 minimal plates with either glucose or cyclodextrin as a carbon 
source and on glufosinate -containing plates. The glufosinate 
resistance and the ability to grow on cyclodextrin were both 
unstable phenotypes with the same degree of instability. This 
indicated that the gene conferring the ability to grow on 
30 cyclodextrin was physically linked to pMT1657 in the transfor- 
mants. 

Colonies from the reisolation plates were cut out and were 
analysed by rocket immune electrophoresis (RIE) using an 
35 antibody raised against the HLL lipase. The transformants gave a 
clear reaction with the antibody, while ToC879 colonies grown on 
maltose gave no reaction. This led to the conclusion that both 
the expression of amylase (i.e., growth on cyclodextrin) and 
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lipase (i.e. antibody binding) had been restored in these 
transf ormants . The gene responsible for this phenotype was named 
amyR, 

5 Isolation o£ the amyR gene 

In order to rescue the amyR gene from the amylase*, lipase* 
transf ormants of ToC879, two different approaches were used suc- 
cessfully. 

10 DMA was prepared from mycelium grown in minimal medium with 
cyclodextrin as the carbon source. 

In the first approach the DNA was packaged into A.-heads using 
the Gigapack® II kit from Stratagene in an attempt to rescue the 

15 original cosmid out of the total DNA. The packaging reaction was 
incubated with XLl-Blue MR E. coli under the conditions 
specified by the kit supplier. The E. coli cells were plated on 
LB plates with 50 /ig/ml ampicillin. Two colonies appeared on the 
plates; the cosmids they contained were identical and named 

20 T0CIOI2. 

In the second approach the total DNA was used in an attempt to 
transform competent E. coli DH5a cells. Sixteen colonies were 
isolated and shown to contain six different plasmids by 

25 restriction enzyme digest . Each of the plasmids was digested 
with EcoRI and subjected to Southern analysis. A "P-labelled 
probe of a mixture of pMT1657 and SuperCosl was used to identify 
DNA fragments not part of any of these vectors. Two EcoRI 
fragments, approximately 0.7 and 1.2 kb in size, did not 

30 hybridize to any of these probes. The 1.2 kb fragment was 
isolated, labelled with ^^P and used as a probe in a 
hybridization experiment with the filters containing the part of 
the cosmid library that gave rise to the original transf ormants . 
Six cosmids from the pool (ToC904) , containing approximately 

35 1000 clones did hybridize. 

Of these, some were shown by restriction enzyme digestion to be 
identical, resulting in the isolation of four different cosmids. 
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All cosmids contained at least parts of the TAKA-amylase gene as 
well. The four cosmids and the cosmid T0CIOI2 were transformed 
into ToC879 by co- trans format ion with pMT1623, a pUC based 
plasmid that carries the bar gene under the control of the A, 
5 oryzae tpi promoter. Fifteen transformants from each co- 
transformation were isolated by resistance to glufosinate and 
tested for the ability to grow on cyclodextrin. 

Eight transformants of T0CIOI2 and three transformants of one of 
10 the other cosmids, 41B12, were able to grow. None of the 
transformants of the other cosmids grew. That not all of the 
transformants of T0CIOI2 and 41B12 were able to grow is likely 
to be a reflection of the co-transformation frequency in each 
experiment. Colonies from the transformants growing on 
15 cyclodextrin were analysed by RIE, and showed that they all 
produced lipase. 

DNA fragments obtained by digesting 41B12 with either BgllJ, 
Hindlll or PstI were cloned into pUC19 in order to subclone amyR 

20 from the cosmid. The subclones were transformed into ToC879 and 
the transformants analysed for the ability to grow on cyclodex- 
trin and produce lipase as described above. As depicted in Fig. 
2, one plasmid called pToC316 was shown to contain an 
approximate 9 kb Hindlll fragment which was identified as 

25 containing amyR. 

Further subcloning resulted in a plasmid called pToC320 
containing a 4.3 kb Hindlll/SacI fragment, which is shown in 
Fig. 3 and was subsequently sequenced on an ABI DNA sequencer 
30 using both further subcloning and primer walking. 

A DNA sequence of 3980 bp including the amyR gene is shown in 
SEQ ID NO: 1. The deduced amino acid sequence is shown in SEQ 
ID NO: 3 and reveals a Gal 4 -type zinc finger sequence between 
35 amino acids 28-54. Such sequences are known to bind to DNA 
(Reece, R.J., and Ptashne, M. (1993) Science 261 909-910). 
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amyR maps close to one of the three amylase genes in IF04177, 
since it was isolated from a cosmid also containing amylase- 
specific DNA fragments. Mapping of the cosmid showed that the a- 
amylase gene and amyR are 5-6 kb apart. Southern analysis of 
5 genomic DNA showed that only one copy of amyR is present in 
IF04177, and confirmed that it maps close to one of the amylase 
genes . 

Analysis of amyR cDNA 

10 mRNA was made by the method of Wahleithner, J. A., et al. (1996, 
Curr. Genet, 29 395-403) from a culture of A. oryzae grown in 
maltose containing medium under conditions favorable for a- 
amylase production. Double stranded cDNA was made by standard 
procedures and used for PGR reactions with the following 

15 primers: 

oligodT primer: TTTTGTAAGCT31 SEQ ID NO. 9 

23087: CCCCAAGCTTCGCCGTCTGCGCTGCTGCCG SEQ ID NO. 6 

20865: CGGAATTCATCAACCTCATCAACGTCTTC SEQ ID NO. 7 

20 20866: CGGAATTCATCGGCGAGATAGTATCCTAT SEQ ID NO. B 

A PGR reaction with the primers 20866 and 23087 resulted in a 
fragment of approximately 1.1 kb. The fragment was digested with 
EcoRI and Hindlll; these restriction sites were incorporated 
25 into the primers, and cloned into a pUC19 vector cut with the 
same enzymes. 

The insert in the resulting plasmid was sequenced, the result 
located one intron in this part of the gene. The intron is 
30 indicated in SEQ ID NO: 2. 

Another PGR reaction with the oligodT primer and primer 20866 
did not result in a distinct fragment. An aliquot of this 
reaction was used as the starting point for a new reaction with 
35 the oligodT primer and the primer 20865, which resulted in a 
fragment of approximately 1.1 kb. This fragment was digested 
with EcoRI and HindJIJ and cloned into pUC19. 
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Sequencing showed that the fragment contained the 3' part of 
amyR and another intron was located. This is also indicated in 
SEQ ID NO: 2, Three independent plasmids were sequenced at the 
5 3' end and two polyA addition sites were located, one at bp no. 
3827 and one at bp no. 3927. 

EXAMPLE 2 

Quantification of glucoamylase synthesis in an aiayR' strain 

10 A. oryzae produces a glucoamylase, encoded by the glaA gene, 
which is regulated by the same substances as a-amylase (Y. Hata 
et al. (1992) Curr, Genet. 22 85-91). In order to see whether 
amyR is also involved in regulation of glaA the synthesis of 
glucoamylase was measured under inducing conditions in the amyR' 

15 strain ToC879 and in the amyR wt strain SRe440, from which 
ToC879 was directly derived. 

Conidiospores from each strain were inoculated in 10 ml YPM (YP 
containing 2% maltose) and grown for four days at 30*C. Super- 

20 natants were collected and analysed for glucoamylase content by 
incubation with p-nitrophenyl a-D-glucopyranoside, a substrate 
that turns yellow when cleaved by glucoamylase. In the procedure 
used, 0.5 ml of fermentation broth was mixed with 1 ml of 0.1 M 
Na-acetate pH = 4.3, containing 1 mg/ml of the substrate. The 

25 samples were incubated for 3 hours at room temperature and 1.5 
ml of 0.1 M NajB^O-, was added. The yellow colour was measured in 
a spectrophotometer at 400 nm. Control samples were made by 
mixing the supernatants first with the borate and then with the 
substrate solution. The results were: 

30 

reaction-control (OD units) 
SRe44 0 0.655 
ToC879 0.000 

35 The absence of any OD reading in the sanple taken from ToC879 
clearly indicate that synthesis of glucoamylase of A. oryzae 
requires the expression of the AmyR transcription factor. 
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EXAMPLE 3 

Overexpression of AmyR 

A plasmid, pToC342, containing the coding region and 3' 
5 noncoding sequences of amyR fused to the promoter for the A. 
oryzae tpx gene was constructed. The tpi gene codes for 
triosephosphate isomerase, a const itutively expressed enzyme 
involved in primary metabolism. The A. oryzae tpi gene was 
isolated by crosshybridization with an A. nidulans cDNA clone 

10 according to the procedure of McKnight, G.L., et al, (1986, Cell 
46 143-147) . Sequencing led to identification of the structural 
gene. The promoter used was a fragment of approximately 700bp 
immediately upstream of the coding region. pToC342 was able to 
complement the mutation in ToC879. To pToC342 was further added 

15 the A. oryzae pyrG gene and the resulting plasmid, pToC359, was 
transformed into JaL250, a pyrG mutant of JaL228 described in 
patent application DK1024/96 filed 1996-09-19. Strains 
containing multiple copies of pToC359 were found to synthesise 
increased levels of glucoamylase. 

20 

Construction of pToC342 and pToC359 

A PCR reaction was made with pToC320 as the template and the 
following primers: 

25 8753 GTTTCGAGTATGTGGATTCC 

8997 CGGAATTCGGATCCGAGCATGTCTCATTCTC 

The resulting fragment was cut with EcoRl/Apal to produce a 
fragment of approximately 180bp which was then cloned into 

30 pToC320 that had been digested with EcoRI/Apal, The resulting 
plasmid, pToC336, was sequenced to confirm that the PCR fragment 
was intact. The 2,6kb BamHI/SacI fragment of pToC336 containing 
the coding region and the 3 ' untranslated sequence of amyR and 
an EcoRI/BajmHI fragment of approximately 700bp containing the 

35 tpi promoter was cloned into EcoRl/SacI digested pUC19. The 
BamHI site downstream of the tpi promoter was introduced in 
vitro, whereas the EcoRI site is an endogenous site from the 
original tpi clone. The resulting plasmid, called pToC342, was 
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cut with Hindlll, dephosphorylated and ligated to a 1.8 kb 
Hindu I fragment containing the A. oryzsLe pyrG gene, resulting 
in a plasmid which was called pToC359. The structure of both 
pToC342 and pToC359 are shown in Fig. 4, wherein Ptpi represents 
5 the tpi promoter and TamyR represents the 3' noncoding region of 
amyR. The cloning of the pyrG gene has been previously 
described in WO 95/35385. 

Expression in A. oryzae JaL250 

10 JaL250 is a pyrG mutant of JaL228 selected by resistance to 5- 
fluoro-orotic acid. JaL228 has been described in patent 
application DK1024/96 filed 1996-09-19. JaL250 was transformed 
with pToC359 using standard procedures and by selecting for 
relief of uridine requirement. The transf ormants were reisolated 

15 twice through conidiospores and grown for four days in YP + 2% 
maltose at 30*'C. Secreted glucoamylase was measured by the 
ability to cleave p-nitrophenyl a-D-gluco-pyranoside . The 
transformants had 5-31 arbitrary glucoamylase units/ml in the 
fermentation broth, while JaL228 had 2-3 units/ml. The best 

20 transf ormant was named ToC1200. Southern analysis showed that 
multiple copies of pToC359 had integrated into the genome of 
T0CI200. Because of the a-amylase promoter, T0C1200 may be used 
advantageously as a host strain for expression plasmids. 

25 EXAMPLE 4 

Carbon catabolite repression of the TAKA-promoter 

The TAKA-amylase promoter is subject to carbon catabolite 
repression. In Aspergilli carbon catabolite repression is at 
least partially mediated via the transcriptional repressor CreA, 

30 a homologue to S. cerevisiae MIGl. The DNA binding sites in 
promoters under CreA control are known to be GC-rich and 
seemingly identical to the MIGl sites in S. cerevisiae. The 
TAKA-amylase promoter contains several potential CreA binding 
sites. To determine whether this promoter is involved in carbon 

35 catabolite repression, three such sites were mutated, but 
provided only partial relief of carbon catabolite repression. 
In contrast, introduction of copies of constitutively expressed 
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AmyR in strains containing the modified promoter coupled to a 
reporter gene completely relieved repression of the reporter. 

Construction of a CreA site deleted TAKA- amylase promoter 

5 Three sites were identified as being potential CreA binding 
sites in the TAKA-amylase promoter by sequence comparison to 
known CreA and MIGl sites. The resulting sites have the 
following sequences: 

10 Site I C CCCG GTATTG 
Site II CC£S:SGAGTCA 
Site III ATATGG£aSGT 

The bases underlined were changed to A*s because such changes 
15 are known to destroy MIGl binding sites. The substitutions were 
made using standard site-specific mutagenesis procedures. An 
expression vector, pToC297, containing the modified promoter and 
the 3' nontranscribed sequence of the glucoamylase gene from A, 
niger was constructed. pToC297 is identical to pToC68 described 
20 in wo 91/17243 except for the changes in the promoter. Both 
plasmids have a unique BamHJ site between the promoter and the 
terminator . 

Expression of a lipase regulated by a CreA' TAKA-amylase 
25 promoter 

A BrnnHI fragment of approximately 950bp containing the cDNA 
encoding a Humicala lanuginosa lipase was cloned into pToC297. 
(The cloning and expression of the H. lanuginosa lipase has been 
previously described in EP 305 216.) The resulting plasmid, 

30 pToC298, was transformed into A. oryzae IF04177 by co- 
transformation with the A. nidulans amdS gene, and its structure 
is shown in Fig. 5, wherein Ptaka-creA represents the CreA 
binding site deficient TAKA-amalyase promoter. The transf ormants 
were reisolated twice through conidiospores and one such 

35 transf ormant, ToC1075, which produces lipase, was chosen for 
further evaluation. ToC1075 and a p960 transf ormant of IF04177 
(previously described in EP 305 216) containing the lipase fused 
to the wild type TAKA-promoter were grown at 30''C in 10 ml YP 
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containing 2% or 10% glucose. Samples were taken daily for 
analysis of lipase in the fermentation broth. The lipase content 
was measured by rocket immune electrophoresis using a polyclonal 
antibody raised against purified lipase. Spent fermentation 
5 broth from A. oiryzae IF04177 did not react with the antibody. 
The glucose content of the fermentation broth was likewise 
measured daily using Tes-tape from Lilly. 

On day one, glucose was detected in all cultures, but on day two 
10 glucose could be detected only in cultures originally containing 
10%. The results of lipase production, shown in Fig. 6, 
indicate that the wild type promoter is repressed until glucose 
is no longer present. Thus, when the glucose becomes exhausted, 
lipase begins to accumulate. Fig. 6 also shows that the modified 
15 promoter is not as tightly regulated, as low levels of lipase 
are produced in the presence of glucose in the 10% glucose 
fermentation. Thus, there is partial glucose derepression seen 
in TOC1075. 

20 Relief of carbon catabolite repression of lipase in ToC1075 by 
pToC342 

TOC1075 was transformed with pToC342 by co-transformation with 
the bar- containing plasmid, pMT1623 . Strains containing multible 
copies of pToC342 and which retained the lipase expression 

25 cassette were identified by Southern blot analysis; one such 
strain was. ToC1075 and ToC1139 were grown at 3 0°C in 10 ml YP 
containing either 2% or 10% glucose, and samples were assayed 
daily for lipase and glucose. The lipase was measured by 
cleavage of para-nitrophenyl-butyrate. The glucose content was 

30 measured with Tes-tape from Lilly. The results, shown in Fig. 
7, indicate that ToC1075, as before, provides partial relief of 
glucose repression while lipase production by ToC1139 is 
independent of the presence of glucose. 

35 EXAMPLE 5 

Southern analysis of A. niger for the amyR gene 

The syntheses of a-amylase and glucoamylase in A. niger, as in 
A. oryzae, are regulated by the carbon source. It is therefore 
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likely that A. niger also contains an amyR gene. This 
hypothesis was tested by looking for cross -hybridization 
between the A. oryzae amyR gene and A, niger chromosomal DNA. 

5 DNA was prepared from A. niger by conventional methods. The 
DNA was cut with BamHI, Bglll, EcoRI, Hindlll, Sail, Xmal or 
Xhal, and the resulting DNA fragments were separated by 
electrophoresis on an agarose gel . The DNA was then blotted 
onto a nitrocellulase membrane and hybridized with a "P- 

10 labelled probe containing part of the structural A. oryzae amyR 
gene. The probe was made by PGR on pToC320 and starts at bp. 
no. 1683 and ends at bp. no. 2615 as shown in SEQ ID NO: 1. The 
hybridization was conducted in lOx Denhardt's solution, 5x SSC, 
lOmM EDTA, 1% SDS, 0.15 mg/ml polyA, 0.05 mg/ml yeast tRNA) at 

X5 SO^C overnight. After hybridization the membrane was washed 
under conditions of increasing stringency and the radioactivity 
on the membrane analysed by a Phospholmager . Figure 8 shows the 
result when the membrane had been washed in 2x SSC,. 0 . 1%SDS at 
58**C. Unique bands can be seen with several of the restriction 

2 0 enzymes. Thus, the A. niger amyR gene can be cloned on the 
basis of this cross -hybridization result. 
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SEQXJENCE LISTING 

5 (1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Novo Nordisk A/S 
IB) STREET: Novo Alle 
10 (C) CITY: Bagsvaerd 

(E) COUNTRY: Denmark 

(F) POSTAL CODE (ZIP) : DK-2880 

(G) TELEPHONE: +45 4442 2668 

(H) TELEFAX: +45 4442 6080 

15 

(ii) TITLE OF INVENTION: A transcription factor 
(iii) NUMBER OF SEQUENCES: 9 

20 (iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPO) 



25 



40 



45 



65 



(2) INFORMATION FOR SEQ ID NO: 1; 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 3980 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOtJRCE; 

(A) ORGANISM: Aspergillus oryzae 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

TCEAGAOOGG (JCftlG"ltJC?lU (JTOCDOCAAG TTGATroCGG ACCXnanCJr A UTIULTILT 60 

TTIMGAAAC GGCMXrCTC TQCXXJina: CSWyOTGftAT TCHMCIfiGA TSDmaGTC 120 

50 TTOAOaAACC AGCTimUAC GGQCAAMO: CTCACAATIG ATOGOOOgTC OCXnTCDCKr 180 

OGMnaroc mrroooGrr gcmggcaaa acatoooogt CAAAantxG AGcaoGaarc 240 

OCiaC3«3Cr CrCGAOCSmS MMOOGAftG CS^MJTCaOGC TS^OTQCAAG GSDOMGrit: 300 

cmj3cr^ocPL iOGftOCTaQC ttcmcxijea axaGiaooG ACAGooGftT cAAGCJcrrrr seo 

GQCaAOnGA CCM^PCSPGHi CXI3OTMGC (XXXSPCTOSG AGAIGAfflCr CrOOOGftXa 420 

60 AACXSACGfiGA OOOGftAAftGA GIGIGTIGAT TCrroOGRGC MTTAO^GIA LTllXXJXOTC 480 

aCXSWOTGG AAAOnrCCT GAOCAATOCT GOOSATCKIC TSKEMOXT AQGCIGftlUU 540 

GIOCATOCOC aSAlAAKIGC OCEACMCaC GCnGAGOOC IGAAAAOSDV O ' lAUTltJlUJ 600 

ACSAGATOCAT TC?U3C3^3ACT CSyaSOOQC AAMSVCftTOS TTCOCCPXXy CAIMTDCAA 660 

GSroOdAAA OaXTOOQCT CJIGOCDGIGA GUGTmULA CQGCRaCICr ASTOGIGCXA 720 
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TGftaSQGSAGC ATOOCjAmOC TIOCMTKIT 

•mfiGoacsyvr Troacmacs tccrgcmtc 

5 MGCTA?G?V3 TSGAAUVTCG TCAlS^jADCTr 
ATOOCrrCA AIOXMTCA TicrciGnT 
CTTACATOOC G^CAIQCIGS A!DXAAGOCT 

10 

GCTRGGfnCA OGKIGRGCIG GCmCAAAC 
CnCriGAAT TCKTOCXriGA CIBGfiCTCTC 

15 ccraoacsGnT GcrcAAOGcnG mttotocac 

ATTDCJCIGCA TCITOCACXX: TCJOCTCAOiA 

GOCTOGKroG GCTTATAAiaS ACTXAACmT 

20 

AGTITOGAAC GACTTTGriTOC aSftTOOGrOC 

CrmGQQCA CWSACITytfVC CAAGAC3\AAG 

25 ATCnCITOC CMAGTAAOS OOLXiJnTiT 

crrrmTrc TrnrrrATr GnnxraGA 
CICRACXniG MGGocacac AGOcxxscnc 

30 

TCEACTIGCT ClUJUBOGAT aXATCATIT 

CTftTOCXSaGC MGICICMT CTOCJtfCDGA 

35 GTCAACXXra GAAAAGCXX3C CEAAACBGGC 

G ' iljTltrm SA GftGOTKXJa* GCX3ACafiGIG 

GG?U0OT3CrC a?roCX3\W3G QGaCTflGTT 

40 

TtXJCrOQCr ICAOGACXaC gicctctcac 

TIUULIA lTi lj GOJ KX XXm OdTClGOSOC 

45 TCCftGACrrc lOTGfiCTCUr TCaCTCGACr 

CTOGACaAAC laSCnaTTOG ACTOCTCXaC 

GTasAOXxzA AADcnrcraG amiGTCAA 

50 

QCmj l XJG'XG AGACRGGAOC AGCK3C3«3V 

CCAAOQCrAC GCTTTCATIG OOGCICIKIG 

55 OGCTTOCftGCA caOGCTrooaG PCOOOCCTVC 

CTIGroaQGA GAMAACTOC TOOOGAAGC 

aaCEAAMT AAOOOaAA AOCTOCXAAC 

60 

0CIW3ACRGA CP03KVCM3G OL'lULfl'itjm 

AdftOGCOA CAAaaOGAAT OCStfMACTC 

65 AfiGGftSAGIA TICP3QCICr TATTOGTCAC 

TCCXAAICaC CAOCStfUDu:: CMAAKmAC 

Aa3«WV0CA GTCMGCroC GCAACTOCAT 



35 

(JUL?iUjTlUJ GAIOQACAAC AMCIOMA 780 

AAGTOOCXCT GQC3GADC3^ GGCICAGOCC 840 

TOOQCTCMG QGAGCTICQG ALWlbTlMG 900 

TGAcrraoQc ttoooswsag TGcnoccroc 960 

GracnaaQCT GmcrrrcA gaaCjIagcsc 1020 

CTQGAAI?^ CAT13\0Gr[GA GAC'lbTiVIA 1080 

CIOT^IAro ClOTGnOQCA COGOOaCJIOC 1140 

TacnnoGr posspcccpog g'iujl'jmag 1200 

CXnGATltXA CX3VITCnWV GOQQOCGrm 1260 

AAAD03Urr CIACAAC3QAA TOGOGAIGK? 1320 

T0CX3VmCT TSGC3AGIGAC GCIT^GQQGAC 1380 

AIGGAGmGA CTOCMffraG MTOKITOCA 1440 

TOCAOCTXA ancmTr Tncrmrr i500 

' iTivrmvr TrncrrcaA tccigaojit iseo 

(jL'iy^ooCTc Gcrrmajr aaoocancr i620 

CTAAWXCIC CACATOQGCG ;tf3A3MJIMC 1680 

CfilTCPCTCA AOOmaAAA MGAAA3QGA 1740 

CrOOaCAAT 'iUOUtflO ^A C GCAAAftia\A 1800 

CXaGOGnCTT CTTCICTOCT GmOCISOG 1860 

OOOCMGCTC TADXTCTCG CrCXI3V3tXA 1920 

CMOSMOOS ClUCXXXrWV TVCXXMOQQC 1980 

cnocsvcxrnA GooGAOOcrc agmctaca 2040 

AOCaOOOCXA GMCIOSTCT CCTCTCC03A 2100 

TOiaXXX3CA CIUIXJGSOSC CAUJLUJlC'r 2160 

' lU i UrilX r itJ AAGKIROCTSr TOCXEKICAT 2220 

GGACIGCCAC CAGCX33GAGC GCTTCncroC 2280 

OQaOGOCAaG C3CATOC3tfiC TCAM3CH3GA 2340 

OQ03CX3AGC3C MOCTOGAaS GACKTOCnRT 2400 

OJiGOGCPCA MAAMGAAT ACaAOSTOCT 2460 

clULTicm ' cim u^ o a s anjmaAA 2520 

ccuaxircpc acxsvogtoca TOGrrcrrcAC 258o 

GAAACIAAOC GIXXS^XSMG 2640 

AC3AAOC7IAA GAAAAGAAAA AAClUJjftCIT 2700 

AOSAAAAAOC MMGCTAOS CKTIACAACA 2760 

(XACftAACXA CMGICCIGr GCTCflGAOGA 2820 
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CJOZAATOCIA GCXTEAOQQCjr TOOCAACCT CA30«CniC TTOGAAAAGC TCSGOXAAA 2880 

TCTCmOSAC TOOCnCKXG (OTCmXaG CfiGCGCAGAC GCXXSROCXXr CX30CTftCITC 2940 

5 

TIOTmiaA rCCPGTCTCG CX3«GCAAAT CTOOCTOSftS OQCXnnCXrJ MKTCEAGRA 3000 

ACTRGACATC CiraiOVCTC AGCAAiaSCT flCAAACXMG ATCIOCSAAAC TCI UailSA C 3060 

10 cpccxxxxxjr ciaxxsoGA ctxxxjncrc oocrrcoyDc •nxxrorocr 3120 

pcnxxncPiPG GoosramaG gogicatogc asoaocMoc caMGraciG TiGAOcacrcA 31B0 

TOGmroaa aiojimgaa MosAocrm ccraacAov oocraacrca togicacic 3240 

15 

CCCKtCKTCr AEAOOOQCAA TCTOACAAAA AOOaCMGftA CStf^AAACTCT ACGACCTOK 3300 

C3UX'iaJL?lA GODGAaSTCr Cm aC TOOCT AnaaCAAAA GOOXXXftCC MCna3CXX3A 3360 

20 MOGAOMC GftCCnXCAG AACTOCTICIG GOOCATICIC ACAACXCTVT (XXXSAfftaX; 3420 

cojiiujuAA TCAiacmcr Taca>ao3cr csgtogaqcm A^iaaaQCA TCAiCRGrrr 3480 

OGAUiunui cnrocarcA CTcacmcr aoCTiajrrr GGnGOGOOx: oxxmrrAT 3540 

25 

GiaOaOGS^ GCTGAMCIG GCHTIGATIT OTTOGOGAIC GOaGRTCRTT TOCRWaGftG 3600 

QCaGAATGSG GCnX3CX3GfiQG QSKTIUIOJr GGLTGLXJUAG GAGA ll ' IUjr TTrSMOQCSG 3660 

30 crcnncrr TntrrriCT o jiuibTXt? ! ' GTiuxgiujr TciaacmjG oggogc i g'i a 3720 

TAmXXTIG AOGaiUlUCA TTOQCSKrroG GOTiacmcr QGrKEATOtr fllQGATIGIT 3780 

nrjimKEPC ICCCCIGCPG AOOGIQCS^AT GATOTOXXSI TCAftTCACrr CnaGGftCIC 3840 

35 

GGAGCACftGG GTGICaJTTC TOOOCTIMT CIGftl3B^3GA GATmiKEftG AA3X7CTTAA 3900 

TGKKMTAT TGTACSVmCX: TDtfWGAAAG ABOGCnQG CACXXXXSAXA TCnCAA33tf3A 3960 

40 AAAClt3G?ICT TCSncmGA 3980 

(2) INFORMATION FOR SEQ ID NO : 2: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 3980 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



55 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus oryzae 



(ix) FEATURE: 
60 (A) NAME/KEY: axon 

(B) LOCATION: 1691. .2676 

(ix) FEATURE: 

(A) NAME/KEY: intron 
65 (B) LOCATION: 2677. .2742 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION:2743. .3193 
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(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 3194. .3277 

5 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 3278. .3653 

10 (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION:join(1691. .2676, 2743.. 3193, 3278.. 3653) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TClMACaOG OCAICTCGTO (nOaSOCAflG Tlt yV lTCCTG AOCXnGTIGT AbTiULTlCT 60 
TTEAfiGftAZVC aCSCAOOOCrC TGOTTPCrOC GAACOaGAAT TSiaGCEW3A TCnMKTGnC 120 
20 TIGACGAACC PGJTCSTCCPC GQGC3\AAraC CrGRCaOTTG AlUiXXUTC OKriCCCAT 180 

OGKmaiGC TftDcroooGr GC3«acxaAA acmooocct CRAAajrocG MaaacaiTC 240 

CCIGCAKICr CrOGAOCAro AGAGGOGWC CAAGrrCAOSC TSCTiraftG GGnKraOJrC 300 

25 

CTAOaCAGCA ATGAQCTroaC TTCaOCasrA OGGRGrOaQG ACftGCATCAT CAAGCXnTTT 360 

CX3GAAanGA aSAAfiGRGriA OCXSCTTAftGC OSACEKIGaG AGKIGAATCr CIGCCGAGOV 420 

30 AflGGAOSBGA ca3GAAAftGA CTSTOnGfiT TCna3GM3C AGITOCaCnA CITOCXnCTC 480 

OGGAAATIGG AAAGCTTOCT GADCAATSCT GQCGATCSTC T3A33aaxrr POXZIGKCIG 540 

GTOCATOOOC GGA33^AATOC OOGACROGftC GCnGW30a: TCaAAMCTA GnMTrcrOG 

AGftGftlCCAT TCAa3«3V:?r CAA3ACIGC3C AAAIACMa3 TTOOTACCr CAXKIT0C3\A 



35 



40 



45 



65 



600 
660 



ocjroociAAA oouL' i txocTr GnacxDSiGA GGCjnTroov ooocATcrrcr pcvogugccpl 120 

TGAOQGGftGC AiUXaTOGC TTOGMnAlT (SXSXOJTTGG GMTX3ACAAC AfiGCTOCAAA 780 

TMGQQGAAT riUULTnOJ TOCfiGGAAIG AfiCTDOOOSr QCCTCPCCCC 840 

AGGCIAAGAG TOGAAmTCG TCAXftGROCT TOSGCTOaG aGAOSnOOG MCnGrrROG 900 

ATCCICTICA KTOCEATICA TiUiU ' lUTlT TGACETOOQC TT000GW3RG TGCrraOCrOC 960 

CTTftCKiooc cRcnrocraG MGOVftGOcr GTOOTADacr cjmumcA cM^Gmxto 1020 

50 GCmGCHTCA OSVTGAGCIG OCmCAAAC CTOGWOAftC CKTraCCHXSV GAdOTICIA 1080 

CnCnGftAT TGKKXCTGA CTAGBGlCrG CKnAADOG CICTGIOXA axaODOGfTOC 1140 

OCrOaQOGIT GCEAfiGOCIG AlTIA TSOtf: TCOHACRCT AITWVaXMG GIOGCEAIHG 1200 

ATiaxnGCA Tcnccaax: 'iuxtcacra cciGKmxA ocancnAA GCGcsaoGTrA 126O 

GCCIGGAIGG QCTAmAIGG AGnMCmT AAACAOGACT CIACAAa3AA TCraSATOIG 1320 

60 AGTITOGAAC GA bTiUl ' lA C OGAroOCnOC ICaCftrnCT TffiGGftCTGWC GCEAOaOGftC 1380 

dTBCQQCA CfiGACEAAAC CAAGACAAftS AT3C3RGI7«3A CTOCftflGI^G ATIAATroCA 1440 

AicrrcTiGC CAAAGTEAACG aJLjLuiTriT TOCACCTOCA GcacrmT TncrrrnT 1500 

crmTmc TrmnArr cttooccwsa TrxuiTritT TnTcncAA TCJcroAOJiT 1560 

CTCAACOCTG ATOOOSACAC AGOCnXTIC GCIAICOCIC GCnTIAajr aXXEAXICT 1620 
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TCERGITOCT CiaSOOCraX GOaTCATrr CTAAMQCIC CPCKTCOXG AGAIWTCRTC 1680 

cmrccGPCc otg tct cat ict cx3v pcc cac att ooc tok aca tdc gw\ 1729 

Met Ser His Ser Pro Thr Asp lie Pro Ser Thr Ser Glu 
15 10 



AflG GAA AIG GAG TCA AOC OCA GAA AM C30G OCT AAA CM GCC TGC GAC 1777 
Lys Glu Met Glu Ser Hir Pro Glu Lys Pto Pro Lys Gin Ala Cys Asp 
10 15 20 25 

AAT IGC CCT CJGA OOC AAA AIC AAG TCT TCT AGA GAG CTT CCA TGC GAC 1825 
Asn cys Arg Arg Arg Lys lie Lys Cys Ser Arg Glu Leu Pro cys Asp 
30 35 40 45 

15 

AAG IGC CPG ojr err err crcr toc tct toc tac agc gac gig ctc ggt i873 

Lys cys Gin Arg leu Leu Leu Ser Cys Ser Tyr Ser Asp Val Leu Arg 
50 55 60 

20 aC3C AAG GQC OOC AAG TTC OOC AOG dC ISC OCT CTC GOT OGC KTC CKT 1921 
Arg Lys Gly Pro Lys Phe Arg Ihr Leu Tyr Pro Leu Ala Pro He His 
65 70 75 

OCA ere Goc tca osa oca osr oct ere acc aag gaa tog cig ccc cca i969 

25 Pro Leu Ala Ser Arg Pro Arg Pro Leu Uir Lys Glu Trp Leu Pro Pro 
80 85 90 

AAC oca GOG GCr TOC CAT TIG 003 TOC 003 AOS TCT OCG OCG TOC AOC 2017 
Asn Pro Gly Ala Cys His Lai Ala Ser Pro Ihr Ser Pro Pro Ser Thr 
30 95 100 105 

cm GOS GAC GOC CM TftT CIA COT OCA GAC TTC TOG GAG TOG TIC ACT 2065 
Val Ala Asp Ala Gin lyr Leu His Pro A^ Phe Ser Glu Ser Phe Ttir 
110 115 120 125 

35 

OSA em OCA ooc oca gat eic gic toc tct ooc gac tog aca aac tog 2113 

Arg Leu Pro Pro Pro Asp Leu Val Ser Ser Pro Asp Ser Thr Asn Ser 
130 135 140 

40 em TTC GAC TOG TOC ACT ATC GQC GCA CTC OOC GOG OCA OGC OGT CIG 2161 
Leu Phe Asp Ser Ser Thr He Gly Ala Leu Pro Ala Pro Arg Arg Leu 
145 150 155 

TOG AOG OCA AAC CIT CIA GOC CAT GTC AAT GIC TTC CTC AAG TPC CTG 2209 
45 Ser Thr Pro Asn Leu Leu Ala His Val Asn Val Phe Leu Lys Tyr Leu 
160 165 170 

TIC OOG ATC ATO OOC GTC GTO AGA CM3 GAC CMS CIG CM5 CM GAC TOC 2257 
Phe Pro lie Met Pro Val Val Arg Gin Asp Gin Leu Gin dn Asp cys 
50 175 180 185 

CAC CfiG OOG GAG OGC TTG TCT OOC CAA 030 TAC GCT TTC ATT GOC GCT 2305 
His Gin Pro Glu Arg Leu Ser Pro Gin Arg lyr Ala Phe He Ala Ala 
190 195 200 205 

55 

CTA TOC GOG GOC AOG CAC ATC CAA CTO A«3 CIG GAC OGT GCA GCA OOG 2353 
Leu cys Ala Ala Hur His He Gin Leu Lys Leu Asp Gly Ala Ala Pro 
210 215 220 

60 GGT OOC GAG GOG GCT TOC GOG OGA GOC AGC CTC CaC GGA CAT OCT ATO 2401 
Gly Pro Glu Ala Ala Ser Ala Arg Ala Ser Leu Asp Gly His Pro Met 
225 230 235 



65 TTO TOG GGA GAA GAA CIC CCG GCT GAA GOC GIG OGC GCA AGA AW3 GAA 2449 
Leu Ser Gly Glu Glu Leu Leu Ala Glu Ala Val Arg Ala Arg Lys Glu 
240 245 250 

TAC AAC GIG GIC GAC GAA ATr AAC ATO GAA AAC CIC CIA AOC TOC TIC 2497 
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Tyr Asn Val Val Asp Glu He Asn Met Glu Asn Leu Leu TTir Ser Phe 
255 260 265 

TTT CIC TIC GCC GGC TAG QSV AAC CIAGRCAGACRGGArCRGGaCT3G 
5 Ite Leu Phe Ala Ala Tyr Gly Asn Leu Asp Arg Gin Asp Gin Ala Trp 
270 275 280 265 

TIC to: CIA Tcjr cpg acc acg tcx: ato gic ttc aca cia ggc cia caa 

Ite TVr Leu Cys Gin Tlir Thr Ser Met Val Phe TJir Leu Gly Gin 
10 290 295 300 

OQG GAA TGC ACA TAC TCX3 AAA CIA AGC GIC GftG GAA GCA GAA GAG AAA 
Arg Glu Ser Tbr lyr Ser Lys Leu Ser Val Glu Glu Ala Glu Glu lys 
305 310 315 

AGG AGA GrrA TIC TGG-CTC TIA TIC GIC ACA GAA AG GTAAGAAAAG 
Arg Arg Val Phe Trp Leu Leu Phe Val Thr Glu Arg 
320 325 

20 AAAAAACICr A CITIUUUA A TCAOCACXaC GIA0C3\AAAA TAACAOGAAA AAOCAG A 

GQC to: GCA TEA CAA CAA GCA AAA OCA GIC ATO CIC OOC AAC TOC AIC 
Gly Tyr Ala L^ Gin Gin Ala Lys Pro Val Met Leu Arg Asn Ser He 
330 335 340 345 

25 

CAC AAA CCA CAG GIC CIG TOC ICA GAC GAC OCA ATC CIA OOC TAC GGG 
His Lys Pro Gin Val Leu Cys Ser Asp Asp Pro lie I^ Ala Tyr Gly 
350 355 360 

30 TICATCAACCICATCAACGlCTICGAAAfiGCICAGCCCAAATCICT^ 
Phe lie Asn Leu He Asn Val E*e Glu Lys Leu Ser Pro Asn lau Tyr 
365 370 375 

GACTOGICT0CGCCGGCaaCAGCAGCGCAGACGCX:GACCOCCa5OCr 
35 Asp Trp Val Ser Ala Gly Gly Ser Ser Ala Asp Gly Asp Pro Pro Pro 
380 385 390 

ACTTCTTCTATCayVTCCAGrCICGOCAAGCSAATCTCrCICG^ 
Thr Ser Ser He Gin Ser Ser Leu Ala Lys Gin He Ser l£u Glu Gly 
40 395 400 405 

GICTOCGAGATCCAGAAAGmGACATCCrCATCACrCAGCAAT3GCIA 
Val Ser Glu He Gin Lys Val Aep He Leu He Ihr Gin Gin Trp leu 
410 415 420 425 

45 

CAAADCATOAIGTOGAAACICTOCAroAOCCACGICACACMOOCGQC 
Gin Ihr Met Met Trp Lys Leu Ser Met 13ir His Val Thr dn Pro Gly 
430 435 440 

50 TCT OGC GAT CSC GOC GFT CIC OOC TIC CP£: CTQ OOC GTS CIA GIC GQC 
Ser Arq Asp Asp Ala Val Leu Pro Phe His Leu Pro Val leu Val Gly 
445 450 455 

AAGGCrGrCAaGGaCGICA3CGaCGOSGC3^T0CCAAGGrGCrGrrGAC 
55 Lys Ala Val Met Gly Val He Ala Ala Ala Ser Gin Gly Ala Val Asp 
460 465 470 

GCr CAT GGT ATC GGA ATO GIAAGAAAGC GADCITOXT CATCAC3iOOC 
Ala His Gly He Gly Met 
60 475 

TOCCICAICA GICACraaOC ATCMCIMA CCOXSSKTCT AACAAAAAOC: GCRG GAA 

Glu 
480 



65 



GAA AAA CIC TAC GAC CTC GGC ACC TCC GIA OOC GAC GTC TOC CQC TOC 
Gin Lys Leu Tyr Asp Leu Gly Thr Ser Val Ala Asp Val Ser Arg Ser 
485 490 495 
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cm A3C ACA AAA GaCG(rCACC3y::criCGCrGAATCX3ACrATCGACC^ 3376 
Leu Ser Thr Lys Ala Ala His His Leu Ala Glu Ser Thr He Asp Pro 
500 505 510 

CGA GAA dC CIC T3G CX3C ATT CTC ACA AOC Cm TOC OSV ATC OQC OCT 3424 
Arg Glu Leu Leu Trp Gly He Leu Thr Ihr Leu Ser Ara He Arg Gly 
515 520 525 

TCC CPA TCA m: CIC TIC OCA CXX3 CIC arc gag CPA ACT OGA GGC AIC 3472 
Ser Gin Ser lyr Leu Phe Pro Ala Leu Val Glu Gin Ser Arg Gly lie 
530 535 540 

ATC ACT TIC GAC TCT TOS CTT TOC ATC ACT GAC TIT CIG OCT TOG TIT 3520 
He Ser Phe Asp Cys Ser Leu Ser He Ser Asp Ete Leu Pro Ser Phe 
545 550 555 560 

OCT GGG CCG OOS GCT ATT ATC TOG OGG AD5 GCT GAA TCT QCXj TIT GAT 3568 
Gly Gly Pro Pro Ala He Met Trp Arg Ihr Gly Glu Ser Gly Phe Asp 
565 570 575 

Tm TIG GOG ATC GOG GAT GAT TIG CAA GAG AGG GAG AAT GM3 OCT GOG 3616 
leu Leu Gly He Ala Asp Asp Leu Gin Glu Arg Glu Asn Glu Gly Gly 
580 585 590 

GAG GOG ATT GIG GIG GCT GGG GAG GAG ATT TOG TIT TGAGQQaQCT 3662 
Glu Gly He Val Val Ala Gly Glu Glu He Ser Pte 
595 600 

CrmCITIT TLUlTIUlUG TGTOITGICT TOGGIGCTIC TSQQQQQGOG aOGGIGmm 3722 

T?M33CnnAC GATCTCX3OT GGGAnGGGG TTOCDOOG TMSmyOAT GGA ' X ' lG ' iTXT 3782 

CnjOMSGIC OSCiutiflGAC OGIGCAATGA TCTQOQGATC AATCACTTCT T3^3GACra3G 3842 

AGC3tfaG3CT GnOGCTIClC GGCTlSmcr GACTaaX3AGA TMKmGAA TCftCTBVKEG 3902 

ATCRTEATIG mMAOnT AAAGAAMAT AIGCrTOQCA OOOOGSVIAIG ACAAiaGAAA 3962 

ACIGGTCTTC ArnnSGA 3980 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Ser His Ser Pro Thr Asp He Pro Ser Thr Ser Glu Lys Glu Met 
15 10 15 

Glu Ser Thr Pro Glu Lys Pro Pro Lys Gin Ala Cys Asp Asn Cys Arg 
20 25 30 

Arg Arg Lys He Lys Cys Ser Arg Glu Leu Pro Cys Asp Lys Cys Gin 
35 40 45 

Arg Leu Leu Leu Ser Cys Ser Tyr Ser Asp Val Leu Arg Arg Lys Gly 
50 55 60 

Pro Lys Phe Arg Thr Leu Tyr Pro Leu Ala Pro He His Pro Leu Ala 
^5 70 75 eo 

Ser Arg Pro Arg Pro Leu Thr Lys Glu Trp Leu Pro Pro Asn Pro Gly 
85 90 95 
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Ala Cys His Leu Ala Ser Pro Thr Ser Pro Pro Ser Thr Val Ala Asp 
100 105 110 

Ala Gin Tyr Leu His Pro Asp Phe Ser Glu Ser Phe Thr Arg Leu Pro 
5 115 120 125 

Pro Pro Asp Leu Val Ser Ser Pro Asp Ser Thr Asn Ser Leu Phe Asp 
130 135 140 

10 Ser Ser Thr lie Gly Ala Leu Pro Ala Pro Arg Arg Leu Ser Thr Pro 
145 150 155 160 

Asn Leu Leu Ala His Val Asn Val Phe Leu Lys Tyr Leu Phe Pro lie 
165 170 175 

15 

Met Pro Val Val Arg Gin Asp Gin Leu Gin Gin Asp Cys His Gin Pro 
180 185 190 

Glu Arg Leu Ser Pro Gin Arg Tyr Ala Phe lie Ala Ala Leu Cys Ala 
20 195 200 205 

Ala Thr His lie Gin Leu Lys Leu Asp Gly Ala Ala Pro Gly Pro Glu 
210 215 220 

25 Ala Ala Ser Ala Arg Ala Ser Leu Asp Gly His Pro Met Leu Ser Gly 
225 230 235 240 

Glu Glu Leu Leu Ala Glu Ala Val Arg Ala Arg Lys Glu Tyr Asn Val 
245 250 255 

30 

Val Asp Glu lie Asn Met Glu Asn Leu Leu Thr Ser Phe Phe Leu Phe 
260 265 270 

Ala Ala Tyr Gly. Asn Leu Asp Arg Gin Asp Gin Ala Trp Phe Tyr Leu 
35 275 280 285 

Cys Gin Thr Thr Ser Met Val Phe Thr Leu Gly Leu Gin Arg Glu Ser 
290 295 300 

40 Thr Tyr Ser Lys Leu Ser Val Glu Glu Ala Glu Glu Lys Arg Arg Val 
305 310 315 320 

Phe Trp Leu Leu Phe Val Thr Glu Arg Gly Tyr Ala Leu Gin Gin Ala 
325 330 335 
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Lys Pro Val Met Leu Arg Asn Ser lie His Lys Pro Gin Val Leu Cys 
340 345 350 



Ser Asp Asp Pro lie Leu Ala Tyr Gly Phe lie Asn Leu He Asn Val 
50 355 360 365 

Phe Glu Lys Leu Ser Pro Asn Leu Tyr Asp Trp Val Ser Ala Gly Gly 
370 375 380 

55 Ser Ser Ala Asp Gly Asp Pro Pro Pro Thr Ser Ser He Gin Ser Ser 
385 390 395 400 

Leu Ala Lys Gin He Ser Leu Glu Gly Val Ser Glu He Gin Lys Val 
405 410 415 



60 



Asp He Leu He Thr Gin Gin Trp Leu Gin Thr Met Met Trp Lys Leu 
420 425 430 



Ser Met Thr His Val Thr Gin Pro Gly Ser Arg Aap Asp Ala Val Leu 

65 435 440 445 

Pro Phe His Leu Pro Val Leu Val Gly Lys Ala Val Met Gly Val He 
450 455 460 
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Ala Ala Ala Ser Oln 
465 

Gin Lys Leu Tyr Asp 
4B5 

Leu Ser Thr Lys Ala 
500 

Arg Glu Leu Leu Trp 
515 

Ser Gin Ser Tyr Leu 
530 

lie Ser Phe Asp Cys 
545 

Gly Gly Pro Pro Ala 
565 

Leu Leu Gly lie Ala 
580 

Glu Gly lie Val Val 
595 



42 

Gly Ala Val Asp Ala His 
470 475 

Leu Gly Thr Ser Val Ala 
490 

Ala His His Leu Ala Glu 
505 

Gly lie Leu Thr Thr Leu 
520 

Phe Pro Ala Leu Val Glu 
535 

Ser Leu Ser lie Ser Asp 
550 555 

lie Met Trp Arg Thr Gly 
570 

Asp Asp Leu Gin Glu Arg 
585 

Ala Gly Glu Glu lie Ser 
600 



Gly lie Gly Met Glu 
480 

Asp Val Ser Arg Ser 
495 

Ser Thr He Asp Pro 
510 

Ser Arg He Arg Gly 
525 

Gin Ser Arg Gly He 
540 

Phe Leu Pro Ser Phe 
560 

Glu Ser Gly Phe Asp 
575 

Glu Asn Glu Gly Gly 
590 

Phe 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 4650 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CTTGCATGCC GCCAGGACCG AGCAAG 26 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 4651 

(iii) HYPOTHETICAL: YES 

(iii) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CTTGGATCCT CTGTGTTAGC TTATAG 26 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 30 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CCCCAAGCTT CGCCGTCTGC GCTGCTGCCG30 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29x base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 
(iii) HYPOTHETICAL: YES 
(iii) ANTI -SENSE: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
CGGAATTCAT CAACCTCATC AACGTCTTC 29 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 
(iii) HYPOTHETICAL: YES 
(iii) ANTI-SENSE: NO 
(xi) SEQUENCE DESC31IPTI0N: SEQ ID NO: 8: 
CGGAATTCAT CGGCGAGATA GTATCCTAT 29 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: primer 

(iii) HYPOTHETICAL; YES 

(iii) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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TTTTGTAAGC TTTTTTTTTT TTTTTTTTTT ' I ' mTrmT T 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRuIc Ubis) 



A. The indicaiions raade below relate to the microorganism referred lo in the description 
on page 11 >ii"e 11-13 



B. IDENTIFICATION OF DEPOSIT 



Further deposits arc identified on an additional sheet ( [ 



Name of depositary institution 

DEUTSCHE SAMMLUNG VON MIKROORGANISMEN UND ZELLKULTUREN GmbH 



Address of depositary institution (Including postal code and country) 

Mascheroder Weg lb, D-38124 Braunschweig, GERMANY 



Date of deposit 

1996-05-10 


Accession Number 
OSM 10671 


C. ADDITIONAL INDICATIONS {leave blank if not appUcablc) This information is continued on an additional sheet Q 



Until the publication of the mention of grant of a European patent or, where 
applicable, for twenty years from the date of filing if the application has beon 
refused, withdrawn or deemed withdrawn, a sample of the deposited microorganism .s 
only to be provided to an independent expert nominated by the person requesting 
the sample <cf. Rule 28(4) EPC) . And as far as Australia is concerned, the expe] t 
option is likewise requested, reference being had to Regulation 3.25 of Austral; 
Statutory Rules 1991 No 71. Also, for Canada we request that only an independent 
expert nominated by the Commissioner is authorize d to have access to a san^le o f 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated Slates) 



E, SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listcdbelowwillbcsubmiiicd to the InicmziiOMl Burciulzitr (specify the general nature of the inMcatio^ 
Number of Deposit") 



For receiving OfHcc use only 



[2 Tbis sheet was received with the international application 



Authorized officer 




For International Bureau use only 



This sheet was received by the International Bureau on: 



Authorized officer 



Forai PCT/RO/l34(July 1992) 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule \3bis) 



A. The indications made below relate to the microorganism referred to in the description 
on page 11 , line 31-34 



B. IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet Q 



Name of depositary institution 

DEUTSCHE SAMMLUNG VON MIKROORGANISMEN UND 2ELLKULTUREN GmbH 



Address of depositary institution (including postal code and couniry) 

Mascheroder Weg lb, D-38124 Braunschweig, GERMANY 



Date of deposit 

1996-05-10 


Accession Number 

DSM 10666 


C. ADDITIONAL INDICATIONS (leave blank if noi applicable) This information is continued on an additional sheet [ } 



Until the publication of the mention of grant of a European patent or, wher<( 
applicable, for twenty years from the date of filing if the application has been 
refused, withdrawn or deemed withdrawn, a sample of the deposited microorganism iii 
only to be provided to an independent expert nominated by the person request in<i 
the sample (cf. Rule 28(4) EPC) . And as far as Australia is concerned, the expert 
option is likewise requested, reference being had to Regulation 3.25 of Australia 
Statutory Rules 1991 No 71. Also, for Canada we request that only an independent 
expert nominated by the Commissioner is authorized to have access to a sample o3 



D. DESIGNATED STATES FOR WHICH I?«)ICATIONS ARE MADE {if the indicadons are not for ail desiputtcd Stoics) 



E- SEPARATE FtmNISHING OF INDICATIONS (leave btonk if not applicable) 



The indications listed below will be submitted to the international Bureau later (specify the general nature of the indications eg., 'Accession 
Number of Deposit') 



For receiving Of Gee use only 



I )4 sheet was received with tbe international application 



Authorized officer 



For Internationa] Bureau use only 



n This sheet was received by the International Bureau on: 



Authorized officer 



Form PCr/RO/134 (July 1992) 
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CLAIMS 

1. A transcription factor regulating the expression of an a- 
amylase promoter in filamentous fungus. 

2. The factor of claim 1 originating from a fungus of the 
genus Aspergillus, Trichoderma, Penicillium, Fusarium, Humicola, 
etc. 

3. The factor of claim 2 originating from the species A. 
oryzae, A. niger, A. awamori, especially A. oryzae IF04177. 

4 . The factor of claim 3 having an amino acid sequence 
comprising one or more fragments of the amino acid sequence 
depicted as SEQ. ID. No 3- 

5. A DNA construct having a DNA sequence coding for the 
factor of any of the claims 1 to 4 . 

6. The DNA sequence of claim 5 having a DNA sequence 
comprising one fragment or a combination of fragments of the 
DNA sequence depicted as SEQ ID N0:1. 

7. A DNA construct comprising a DNA sequence encoding a 
transcription factor exhibiting activity in regulating the 
expression of an a-amylase promoter in a filamentous fungus, 
which DNA sequence comprises 

a) the transcription factor encoding part of the DNA sequence 
cloned into plasmid pToC320 present in E. coli ToC1058, DSM 
10666, or 

b) an analogue of the DNA sequence defined in a) , which 

i) is at least 60% homologous with the DNA sequence 
defined in a) , or 

ii) hybridizes with the same nucleotide probe as the 
DNA sequence defined in a) , or 

iii) encodes a transcription factor which is at least 
50% homologous with the transcription factor 



wo 98/01470 



PCT/DK97/00305 



48 

encoded by a DNA sequence comprising the DNA 
sequence defined in a) , or 

iv) encodes a transcription factor which is 
immunologically reactive with an antibody raised 
against the purified transcription factor encoded 
by the DNA sequence defined in a) , or 

v) complements the mutation in ToC879, i.e. makes 
TOCB79 able to grow on cyclodextrin and produce 
lipase when transformed with said DNA sequence. 

8. The DNA construct according to any of the claims 5 to 7, 

in which said DNA sequence is obtainable from a filamentous 
fungus . 

15 9. The DNA construct according to claim 8, in which said 
filamentous fungus belongs to any of the groups Phycomycetes, 
Zygomycetes, Ascomycetes, Basidiomycetes and fungi imperfect i, 
including Hyphomycetes such as the genera Aspergillus, 
Penicillixm, Trichoderma, Fusarium and Humicola, in particular a 
strain from Aspergillus sp., and especially from A. oryzae. 



10 



20 



25 



10. The DNA, construct according to claim 9, in which said DNA 
sequence is isolated from or produced on the basis of a DNA 
library of an Aspergillus oryzae strain. 

11. The DNA construct according to claim 5 to 8, in which 
said DNA sequence is obtainable from a yeast strain, especially 
of , Saccharomyces . 

30 12. The DNA construct according to claim 1, in which the DNA 
sequence is isolated from Eschericia coli DSM 10666. 

13 . A recombinant expression vector comprising a DNA 
construct according to any of claims 5 to 12. 



35 



14. A cell comprising a DNA construct according to any of 
claims 5 to 12, or a recombinant expression vector according to 
claim 13, 



wo 98/01470 



PCT/DK97/00305 



49 

15. The cell according to claim 14, which is a eukaryotic 
cell, in particular a fungal cell, such as a yeast cell or a 
filamentous fungal cell. 

16. The cell according to claim 15, which is a strain of 
Aspergillus sp, in particular a strain of A. niger or A. 

17. The cell according to claim 15, which is a strain of 
Trichoderma sp., in particular T. reesei. 

18. The cell according to claim 15, which is a strain of 
Saccharomyces, in particular a strain of S, cerGvisiae, 

19. A method of producing a polypeptide of interest compri- 
sing growing a cell of any of the claims 14 to 18 under 
conditions conducive to the production of said factor and said 
polypeptide of interest, and recovering said polypeptide of 
interest . 

20. The method of claim 19, wherein said fungus is a fungus 
of the genus Aspergillus, Trichoderma, Penicillimn, Fusarium or 
Humicola. 

21. The method of claim 20, wherein said cell is of the 
species A. oryzae, A. niger, or A. awamori. 

22. The method of claim 19, 20, or 21, wherein said 
polypeptide of interest is a medicinal polypeptide. 

23. The method of claim 22, wherein said medicinal polypep- 
tide is a growth hormone, insulin, or a blood clotting factor. 

24. The method of claim 19, 20, or 21, wherein said 
polypeptide is an industrial enzyme. 
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25- The method of claim 24, wherein said industrial enzyme is 
a carbonyl hydrolase, carbohydrase, protease, lipase, amylase, 
cellulase, oxido reductase, glucoamylase, or esterase. 

26. Use of a factor of any of the claims 1 to 4 for enhancing 
the expression of a polypeptide of interest in a filamentous 
fungus . 

27. The use of claim 26, wherein said factor is the factor of 
claim 4 . 

28. The use of claim 27, wherein said fungus is a fungus of 
the genus Aspergillus, Penicillium, Trichoderma, Fusarium and 
Humicola, in particular a strain from Aspergillus sp., and 
especially from A. oryzae sp. 



29. The use of claim 28, wherein said fungus is of the 
species A. oryzae, A. niger, A. awamori, T. reesei, or T. 
harzianuw. 



30. The use of any of the claims 26 to 29, wherein said 
polypeptide of interest is a medicinal polypeptide. 

31. The use of claim 30, wherein said medicinal polypeptide 
is a growth hormone, insulin, or blood clotting factor. 

32. The use of any of the claims 26 to 29, wherein said 
polypeptide is an industrial enzyme. 

33. The use of claim 32, wherein said industrial enzyme is a 
carbonyl hydrolase, carbohydrase, protease, lipase, amylase, 
cellulase, oxido reductase, glucoamylase, or esterase. 
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Fig. 2 
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Fig. 3 
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